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HIGH DENSITY SIGNAL CLASSIFIER FOR MEDIA GATEWAYS 



RELATED APPLICATION 

This application is a continuation-in-part of co-pending Patent Application No. 
09/696,730, by Tanrikulu, et al, filed October 25, 2000, entitled "Method and 
5 Apparatus for Performing High-Density DTMF, MF-Rl, MF-R2 Detection", the entire 
teachings of which are incorporated herein by reference. 

BACKGROUND OF THE INVENTION 

Today's telephony networks are being employed to support many different 

communication devices using many different communication protocols. A typical 
1 0 office, for example, is equipped with electronic communication equipment, such as 

touch tone telephones, facsimile machines, and modems for Internet communications. 

The office communication equipment communicates with a device at a central office, 

for example, which, in turn, facilitates communications between the office 

communication equipment and a node located elsewhere on a POTS (Plain Old 
15 Telephone System) network, the Internet, or other electronic network. Service providers 

maintain, upgrade, and profit fi-om the communication services provided by the central 

office commimication equipment. 

Service providers are limited in the amount of space available where the 

equipment is located, which is typically a central office. Increasing the number of voice 
20 line channels capable of being supported by a media gateway is important to service 



2376.2003-000 



-2- 



providers because service providers generate revenue as a function of the number of 
voice channels that can be supported by their equipment. 

A device at the central office that is used to provide communication services for 
the office communication equipment is known as the media gateway. Among several 
5 services provided, the media gateway connects the office communication equipment on 
the POTS network to another network, such as the Litemet. 

Another service provided by the media gateway is encoding voice signals into 
voice-over-EP (VOIP) packets, where IP means Internet Protocol. Yet another service 
provided by the media gateway is detection of the several conmiunication protocols 
10 used by the office communication equipment. 

A digital signal processor is a device in the media gateway that processes 
communication signals carried by the voice chaimels. Digital Signal Processors (DSP's) 
are sophisticated computer chips that efficiently execute many low-level mathematical 
processing functions. For instance, addition and multiplication functions are executed 
15 in parallel to perform both fimctions in one clock cycle, thereby increasing the 
processing speed (i.e., throughput) of voice channel communication signals. By 
increasing the processing speed, many voice channels can be serviced by a single DSP 
and, therefore, less physical space is occupied by equipment capable of supporting more 
channels. 

20 The DSP is also capable of executing high-level fimctions, such as signal 

classification, in which a signal classifier distinguishes communication protocols fi*om 
one another. The classifier detects sinusoidal signals (sinusoids) composing the several 
communication protocols during the diahng portion of the commimication protocols. 
For example, the dual-tone of dialed digits firom a touch tone telephone (i.e., DTMF - 

25 Dual Tone Multiple Frequency) have sinusoids of different fi-equencies fi-om those of 
other protocols (e.g., MF-Rl - Multiple Frequency, One Row, a digital protocol used 
between digital networking devices). 

Classifiers are traditionally processing intensive, time consuming techniques 
when executed in software on a signal processor, using several (e.g., five) MIPS 
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(Million Instructions Per Second) or more to classify signals. Since DSP's are generally 
only capable of two hundred MIPS, the DSP's can only support a few classifiers since 
many more functions and general overhead processes are required to be done by the 
DSP. Thus, the media gateways must employ many DSP's to support the high numbers 
5 of customers using digital telephony equipment. The number of customers are expected 
to increase as other telephony services, such as VOIP, become more widely used- 

SUMMARY OF THE INVENTION 

The number of channels a media gateway can service is a function of the 
processing performance and memory requirements of a process employed to distinguish 

10 a given communication protocol from among several communication protocols. The 
processing requirements to distinguish the several commxmication protocols have 
traditionally been high. Since the number of processors in media gateways is limited, 
the number of media gateways required to service customers is many. The problem is 
that, while service providers use media gateways as revenue producing devices, the 

15 service providers have limited space in locations, such as central offices, in which to 
house the media gateways. Therefore, it is advantageous to minimize the amount of 
processing used to distinguish the several communication protocols. 

A classifier using the principles of the present invention uses less processing 
than existing classification methods by about an order of magnitude to distinguish a 

20 given communication protocol from among several communication protocols, which 
ultimately reduces the number of media gateways necessary to support the same number 
of customers. 

According to the principles of the present invention, the classifier decomposes a 
subject signal into subbands. The classifier then determines a presence of energy in the 
25 respective subbands, where the energy corresponds to at least one sinusoidal signal 
(sinusoid) in the subject signal. The classifier then filters the at least one sinusoidal 
signal to classify the signal as using a particular communication protocol. 
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In one embodiment, the classifier decomposes the subject signal into subbands 
by employing power symmetric, infinite impulse response (PS-HR) filters. A 
preclassifier may be employed to estimate a frequency of a given sinusoid. Estimating 
the frequency of the given sinusoid may include modeling the given sinusoid by solving 
5 a second-order, auto-regression equation. Alternatively, estimating the frequency of the 
given sinusoid may include and be improved by accessing a look-up table having pre- 
determined ranges of data corresponding to the frequency of the given sinusoid. In the 
look-up table, the ranges of data may be adjusted to account for estimation error due to 
finite signal length. Prior to estimating the frequency of the given sinusoid, the 
1 0 classifier optionally multiplies the subband with a window function, such as an N-point 
triangular window. 

The classifier classifies (i.e., identifies) the signal as being one of the following 
signal types: DTMF, MF, ANS (V.25), LEC_DIS, V.21, or AA, for example. When 
determining a presence of energy in the respective subbaads, the classifier narrows 
15 possibilities as a function of the presence, or absence, of energy in the respective 

subbands. The classifier may instantiate notch filters at select frequencies estimated to 
be frequencies of sinusoids in the signal to fiirther narrow classification possibiHties. 

The classifier decomposes the subject signal to significantly reduce the 
bandwidth of the subbands compared to the subject signal Li one embodiment, the 
20 classifier processes the subject signal in frames having a pre-determined frame size. 
One such frame size has been determined to be 10 miUiseconds in length for sufficient 
estimation accuracy. 

Before reporting the subject signal-type, the classifier may require multiple (e.g., 
three) consecutive classifications identifying the same signal-type. After achieving the 
25 threshold of consecutive classifications, the classifier reports the communication signal 
as being a facsimile, modem, voice, or DTMF signal. 

By spUtting the subject signal into subbands, thereby reducing the bandwidth of 
the subject signal into lower-bandwidth subbands, and through the use of efficient filters 
such as PS-IIR filters, the classifier uses an order of magnitude fewer digital signal 



processor (DSP) instruction cycles (e.g., resulting in less than 0.5 MIPS) below 
traditional classifiers. In this way, many classifiers and dialed-digit detectors may be 
disposed on a single digital processor to support many telephony voice channels. The 
classifier herein described is useful in inter-networking devices, such as a media 
gateway, to facilitate many telephony services, such as VOIP. 

BRIEF DESCRIPTION OF THE DRAWINGS 

The foregoing and other objects, features and advantages of the invention will be 
apparent jfrom the following more particular description of preferred embodiments of 
the invention, as illustrated in the accompanying drawings in which like reference 
characters refer to the same parts throughout the different views. The drawings are not 
necessarily to scale, emphasis instead being placed upon illustrating the principles of the 
invention. 

Fig. 1 is a block diagram of a network having a media gateway in which the 
present invention signal classifier may be deployed; 

Fig. 2 is a block diagram of the media gateway of Fig. 1; 

Fig. 3 is a schematic diagram of a Jfront end of the signal classifier of Fig. 2; 

Fig. 4 is a schematic diagram of a band splitter in the front end of Fig. 3; 

Fig. 5 is a schematic diagram of a polyphase implementation for the band splitter 
of Fig. 4; 

Fig. 6 is a schematic diagram of the signal classifier of Fig. 2 having eight, 
constantly operating, parallel notch filters; 

Fig. 7 is a schematic diagram of the signal classifier of Fig. 2 having 
preclassifiers to reduce the number of notch filters; 

Fig. 8 is a plot used by the classifier of Fig. 7 for classifying signals; 

Fig. 9 A is a plot of an estimate of a 94 1 Hz signal of a DTMF signal by the 
Preclassifier of Fig. 7; 

Fig. 9B is a plot of an estimate of a 770 Hz signal of a DTMF signal by the 
Preclassifier of Fig. 7; 
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Fig. 10 is a plot of MF-Rl frequencies estimated by the Preclassifier in subband 
1 versus decision boundaries for 10 msec frame sizes used by the classifier of Fig. 7 
using windowing; 

Fig. 1 1 is the plot of Fig. 10 used by the Preclassifier without windowing; 
5 Fig. 12 is a flow diagram of an embodiment of the process of the classifier of 

Fig. 7; 

Fig. 13 is a flow diagram of an embodiment of DTMF detection logic used by 
the process of Fig. 12; 

Fig. 14 is a flow diagram of an embodiment of MF-Rl detection logic used by 
10 the process of Fig. 12; 

Fig. 15 is a block diagram of a V.21 detector executing in the process of Fig. 12; 

Fig. 16 is a plot of the output from the low pass filter (LPF) of the V.21 detector 
of Fig. 15; 

Fig. 17 is a plot of a signal having a phase reversal detected by a detector in the 
15 process of Fig. 12; 

Fig. 1 8 A is a plot of phase reversal detection identified by accepted tests by the 
detector in the process of Fig. 12; and 

Fig. 18B is a plot of a phase reversal detection identified by rejected tests by the 
detector in the process of Fig. 12. 

20 DETAILED DESCRIPTION OF THE INVENTION 

A description of preferred embodiments of the invention follows. 
Fig, 1 is a block diagram of an example network 100 in which an embodiment of 
the present invention may be deployed. The network 100 includes a first network 
device 105 and a second network device 110. The first and second network devices 
25 1 05, 1 10 may be telephones, facsimile machines, computers with modems, servers, 

routers, gateways, other network devices or combinations thereof. The first and second 
network devices 105, 110 are coupled to a gateway 1 15 in a manner facilitating 
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electronic commumcations. The communications are conducted via wire, wireless, or 
fiber optic communication media/protocols. 

The gateway 1 15 is coupled to and provides communication with network 
devices in other networks. As shown, the gateway 1 15 is coupled to a Plain Old 
5 Telephone System (POTS) network 120, hitemet Protocol (IP) network (e.g., the 
Litemet) 125, and other networks 130. 

The gateway 115 provides mechanisms for communications between networks 
of different protocols. For example, the first network device 105 is able to 
communicate with a network device on the POTS network 120 via the gateway 115. 

10 Fig. 2 is a block diagram of a subset of components in the gateway 115. The 

components include an interface 205, processor 210, and memory 220. The gateway 
115 may include many components 205, 210, 220 to support parallel or distributed 
processing to support many users at a time. 

Inside the processor 2 1 0 is a representation of a signal classifier 215. In the 

15 preferred embodiment, the signal classifier 215 is software executing on the processor 
210. The signal classifier (CLS) unit 215 has the task of detecting several tonal 
signals that arise in telephony. The classifier 215 offers a firamework where several 
signal detection tasks, such as DTMF, MF-Rl, ANS, AA, V.21 and Line Echo 
Canceller Disable (LEC_DIS), are addressed. This fi*amework is general enough to be 

20 expanded to other detection tasks and other tones as specified in the fixture. As it stands, 
the classifier 215 is designed for media gateways; however, it can be configured to 
detect only one or a subset of the features listed above and used for other applications as 
well. The classifier 215 first tests each signal type individually according to the 
individual communication protocol standards or recommendations. Furthermore, the 

25 same process is repeated where multiple detectors are enabled. 

The design philosophy targets maximum density independent of the platform of 
implementation. Minimum instruction cycles and memory usage is achieved through 
Digital Signal Processing (DSP) techniques described hereinbelow. 
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The classifier 215, using the DSP techniques according to the principles of the 
present invention, reduces the milhons of instructions per second (MIPS) consumed by 
DSP's in classifying signals in two ways. First, the classifier 215 improves the 
processing of a subject communication signal. Second, the classifier 215 improves the 
5 filtering structures used to process the subject communication signal. By approaching 
the problem of classifying the subject communication signal in terms of processing 
techniques and processing tools, the MIPS used to classify a given communication 
signal are reduced an order of magnitude over traditional methods (e.g, from 5 MIPS to 
0.5 MPS). This processing reduction translates into more channels per DSP, which 

1 0 ultimately results in a reduction of cost per channel for the service provider. 

In terms of signal processing, the classifier 215 processing the communication 
signal takes a two-fold approach at reducing the complexity of the processing. First, the 
communication signal is decimated to 4iaiz and then split into subbands, which 
reduces the bandwidth requirements for the classifier 215 from 8 kHz to 2 kHz. 

15 Second, rather than solving trigonometric equations, the classifier 215 may use pre- 
calculated possible solutions in an accessible database since DSP's execute look-up 
statements in fewer instruction cycles than trigonometry statements (e.g., arctangent). 
In such an embodiment, the classifier 215 may employ a Preclassifier (discussed later in 
detail in reference to Fig. 7) to execute the processing and look-up statements. Thus, 

20 the classifier's approach to signal processing saves several factors of MIPS by this two- 
fold approach at reducing the complexity of the processing. 

In terms of filtering structures, the classifier 215 also uses a two-fold approach. 
First, the classifier 215 preferably employs power symmetric, infinite impulse response 
(PS-IIR) filters where possible. For example, PS-IIR filters are used to perform the 

25 band-splitting of the communication signal. Because PS-ICR filters have reduced 
complexity and increased efficiency, among other advantages, over finite impulse 
response (FIR) filters and other forms of IIR filters, the classifier's efficiency is 
improved. 
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Second, the classifier 215 optionally employs a preclassifier, which determines a 
frequency of a sinusoid in a given subband. By knowing the frequency of the sinusoid 
in the given subband, the classifier 215 can instantiate a notch filter at a select 
frequency, corresponding to the frequencies of interest, as in DTMF, MF-Rl, ANS, AA, 
5 etc. The use of a Preclassifier is discussed in co-pending U.S. Patent Application No. 
09/696,730 filed October 25, 2000, entitled "Method and Apparatus for Performing 
High-Density DTMF, MF-Rl, MF-R2 Detection" by Tanrikulu et al, the teachings of 
which are incorporated herein by reference in their entirety. So, rather than 
continuously running notch filters for all possible frequencies of sinusoids, as discussed 

10 in one embodiment of Tanrikulu et al, for all communication protocols that the 

classifier 215 maybe called upon to classify, the classifier 215 can instantiate one or 
two notch filters, as necessary, for the given communication protocol Therefore, since 
each notch fdter consumes instruction cycles, a reduction from 8 notch filters or more to 
2 or 1 notch filters saves MIPS, which are advantageously used by the DSP for other 

1 5 processing, including the running of additional classifiers for servicing other voice 
channels. 

Beyond the reduction in complexity to yield a reduction in instruction cycles and 
MIPS, the classifier 215 improves robustness of classifying signals in at least four ways. 
First, windowing the signal for small samples of data improves parameter estimates 

20 (i.e., parameters used to determine frequencies of sinusoids) and extends the frequency 
range of parameter estimates to the fiiU range of possible frequencies of sinusoids. An 
N-point triangular window provides good experimental results for the worst performing 
frequencies. Second, estimated correlation values are averaged across frames of data so 
that variational effects are minimized. 

25 Third, in one embodiment, noise effects expected to be found on voice lines are 

pre-calculated by using noise modehng algorithms in conjunction with a Monte Carlo 
simulation, in which noise levels, w(n), and sample sizes, N, are varied. The noise 
effects are then included in the databases as "biased estimates." Fourth, frame sizes are 
optimized to reduce anomalies at low and high frequencies. In the communication 
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protocols of interest, thirty milliseconds of frames are typical. To certify a signal 
classification, multiple consecutive classifications may be used as a threshold, such as 
three classifications indicating the same protocol, so as not to produce false alarms. 
When classifying communication protocols of interest, tests have shown that processing 

5 a ten millisecond frame size works well, providing better results than processing a five 
miUisecond frame size. 

The following detailed description, tables, and figures provide details of the 
general description of the classifier 215 just presented. 

Fig. 3 is a schematic diagram of an embodiment of the front-end processing 300 

10 performed in the classifier 215 at any instant in time. The input to the front-end 

processing 300 is the incoming signal from the line sampled with an A/D (not shown) at 
8KHz. This signal is decimated by a factor of 2: 1 by a decimator 305, which alters the 
sampling rate to 4KHz. Although this straight decimation operation introduces aliasing, 
the ahasing is insignificant for detection of tonal signals. Ahasing manifests itself as a 

1 5 change of the frequency of the tone to be detected. 

After tiie decimation, the signal is passed through a High-Pass Filter (HPF) 310 
to remove any DC or very low frequency noise signals on the line. In another 
implementation, the HPF 310 can be appUed on the subband zero signal (Subg) Sq, 
shown in Fig. 3 after a subsequent band-spUtter 315. While these alternatives serve the 

20 same purpose, there could be small differences in round-off noise in a finite-precision 
implementation. The HPF 310 has the input-output transfer function 



HPF(z) = 

2 l-a z 



25 



where a <1.0. 

The high-pass fiftered signal is input to the band-sphtter 315, which performs a 
band-split operation, hi one embodiment, the band-splitter 315 performs subband 
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decomposition by employing Power-Symmetric IIR (PS-IIR) filters. An embodiment of 
the band-splitter 315 is shown in detail in Fig. 4. 

In Fig. 4, the first-order all-pass sections 415 have the transfer function 

These are implemented through the compact realizations of all-pass networks (see S.K. 
Mitra and K. Hirano, "Digital All-pass Networks", IEEE Trans. On Circuits and 
'=} Systems, vol. CAS-21 , no. 5, pp. 688-700. Sept. 1974; R.A. Valenzuela and A.G. 

m Constantinides, "Digital Signal Processing Schemes for Efficient Interpolation and 

ft 10 Decimation", IEEE Proc. Vol. 130, no. 6, pp. 225-235, Dec. 1983; and 0. Tanrikuhi, 

, Adaptive Algorithms for Accelerated Convergence and Noise Immunity, PhD. Thesis, 

' '■■ Imperial College, London, UK, 1995, the teachings of all are incorporated herein by 

r : reference in their entirety) that use a single multiplier and a single delay. The scahngs 

P ■ 410 by 0.5 are to keep the signal level at unity. After the two paths are combined by 

1 5 adding (step 420a) and subtracting (step 420b) the outputs of the all-pass sections, the 
t , subband signals Subg (So) and Sub, (Sj) have a sampling rate of 2KHz. Moreover, Subg 

represents a IKHz interval containing the 0-lKHz input of the band-splitter 315. Subj 
represents a IKHz interval containing the l-2KHz input of the band-splitter 315. Note 
that, due to the decimation operation of the decimator 405 by a factor of 2:1, the 
20 frequencies in Subj will appear as their mirror images around IKHz. 

As a by-product of the subband decomposition operation, power computation units 
425a, 425b compute the energies of the subband signals as follows: 
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i.e. the sum of squares of the individual subband signals, where is the number of 
samples per subband for every input frame, hi this design, the frame size is 10msec. 
Therefore, at 2KHz sampling frequency, each subband has A^=20 samples. 

Note that in Fig. 4, the present invention takes advantage of the fact that the PS-UR 
5 band-splitter 3 1 5 is lossless in that the sum of the output energies of the subband signals 
is equal to the input energy. Therefore, the total energy of the signal is computed from 
£■() and by: 

E^ = E,+ E, (4) 

Having all the above information, the classifier 215 (Fig. 2) first checks to see if 
10 Ej> Sj, where s^. is the threshold below which the classifier 215 labels the current 

frame as having no signal in it. When Ej.>s^, the classifier 215 determines if there is 
signal in both subbands or only in one subband. These are established by comparing the 
pairs (^0' -^'r) and {E^, Ej). These comparisons are performed by the Signal Energy 
Assesment (SEA) unit 320 shown in Fig. 3. 
1 5 Below are tables that indicate how the frequency of each tone of interest is mapped 
into the subbands, Subo and Subi, when the sampling frequency is brought down to 
2KHz. 



DTMF 
Frequency (Hz) 


Subo - (Hz) 


Subi - (Hz) 


697 


697 




770 


770 




852 


852 




941 


941 




1209 




791 


1336 




664 


1477 




523 


1633 




367 



Table 1. DTMF frequencies in subbands. 
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MF-Rl 

Ftequenqr(H2) 


Subo - (Hz) 


Subj - (Hz) 


700 


700 




900 


900 




1100 




900 


1300 




700 


1500 




500 


1700 




300 



Table 2. MF-Rl frequencies in subbands. 





Frequencies for 
Detection 


Subo - (Hz) 


Sub, - (Hz) 


ANS+LEC DIS 


2100 




100+- 15 










V.21 


1650 




350 




1850 




150 










AA 


1800 




200 











Table 3. Subband mapping of all other tones. 



15 SUBBAND DECOMPOSITION 

The subband decomposition operation in Fig. 3 is achieved through Power 

Symmetric IIR (PS-IIR) filters 400 (Fig. 4), as shown in detail in Fig. 5 (see R.A. 

Valenzuela and A.G. Constantinides, "Digital Signal Processing Schemes for Efficient 

Interpolation and Decimation", IEEE Proc. Vol. 130, no. 6, pp. 225-235, Dec. 1983; and 
20 O. Tanrikulu, Adaptive Algorithms for Accelerated Convergence and Noise Immunity, 

PhD. Thesis, hnperial College, London, UK, 1995 the teachings of which are 

incorporated herein by reference in their entirety). 

Referring to Fig. 5, these PS-IIR filters 400 are based on the decomposition of a 

general elliptic BR filter into a special transfer function representation in terms of all- 
25 pass sections 5 1 Oa, 5 1 Ob. For the purposes of this embodiment of the design, the present 

invention uses the following structure: 
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where aoo. «oi' «iO' «ii are all-pass coefficients. These coefficients can be optimized to 
get a low-pass filter with a cut-off frequency at n/l rads. The design techniques are 
sophisticated and will not be discussed here (see O. Tanrikulu and M. Kalkan, "Design 
and Discrete Re-optimization of All-pass Based Power Symmetric UR Filters," 
Electronics Letters, vol. 32, no. 16, pp. 1458-1460, 1996, the teachings of which are 
included herein by reference in its entirety). Note that, once Hjjj^ is designed, a mirror 
image high-pass filter is obtained easily as 



' ' 1 + aooz-' 1 + a,,z-^ 1 + a,,z ^\ + a,,z 



1 0 Finally, the polyphase subband decomposition implementation for the band-splitter 315 
(Fig. 3) is obtained, as shown in Fig. 5, where: 



are cascades of first-order all-pass sections 415 (Fig. 4). Note that each of these sections 
510a, 510b are implemented by using the compact realizations of all-pass networks, 
1 5 which require only one memory location and one muUipUer for each all-pass section 
415 as should be understood from the equations above (see S.K. Mitra and K. Hirano, 
"Digital All-pass Networks", IEEE Trans. On Circuits and Systems, vol CAS-21, no. 5, 
pp. 688-700, Sept. 1974, the teachings of which are incorporated herein by reference in 
its entirety). 
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Using subband decomposition for the detection of the various telephony signals has 
a number of advantages. The foremost is, of course, the reduction of the sampling 
frequency to 2KHz, which reduces the computational complexity of the subsequent 
detection tasks in each subband. There are a number of other subtle differences 
5 compared to the implementations in the prior art. These other subtle differences have to 
do with the detection performance. For instance, in DTMF detection, the frequencies are 
grouped as F^={697, 770, 852, 941}, Fc = {1209, 1336, 1477, 1633} before subband 
division, and one frequency from each group defines a valid DTMF tone. Note that the 
subband decomposition isolates each group. Therefore, the detection problem then 

10 becomes searching for a single tone in each subband. 

In an implementation where the tone groups are not isolated as above, the validation 
of the tones is more difficult because of the difficulty associated with using notch filters 
to vahdate tones. In such an implementation, each notch filter computes its output 
signal energy. The ratio of the input energy to the output energy is used to accept or 

1 5 reject a tone. In a subband design, these ratios are going to be large since there is only 
one tone per subband for a DTMF signal. However, in a typical prior art implementation 
with no subband decomposition, the above-mentioned ratio is not so large since, if tone 
A € is notched, its output energy will contain tone B's e energy, and vice versa. 
The problem of this non-subband implementation is that when the twist requirements on 

20 the relative energies of tone A and tone B are tested, it is difficult to set thresholds. 

In a subband implementation according to the principles of the present invention, a 
primary level solution is to connect a group of four notch filters, 605a, 605b 
(collectively 605) to each subband signal, such as shown in Fig. 6 and described in co- 
pending U.S. Patent Application No. 09/696,730 filed October 25, 2000 by Tanrikulu et 

25 al, entitied "Method and Apparatus for Performing High Density DTMF, MF-Rl , MF- 
R2 Detection", incorporated herein by reference in its entirety. 

Note that the notch filters have the input/output transfer fimction: 
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l-2cos(^^) z-^+z-' 

N{z) = r ^ (8) 

l-2rcos(^y^) z-'+rV 

where r is the pole radius, /„ is the nominal notch frequency, and^ is the sampHng 
frequency. The filter gain is chosen as r, so that the gain of the notch fiher never 
exceeds unity across the spectrum, which is desirable for a fixed-point implementation, 
5 such as on a fixed-point DSP (e.g. Analog Devices® ADSP21 1 1 or Texas histraments 
TM5320C50). 

Following the notch filtering, the DTMF detection logic 610 is executed to 
determine whether the signal meets the criteria of a DTMF signal. 

The solution in Fig. 6 is still wasteful in the sense that the all eight notch filters 605 

1 0 are used at all times only to validate a DTMF tone that has only two frequencies in it. 
Therefore, if a mechanism is available to determine which two notch filters are relevant 
and which others serve no purpose at every point in time - subject to passing the signal 
energy assessment (SEA) test, then it is more efficient to run those two relevant notch 
filters only, rather than all eight. The mechanism to select which notch filters is/are 

1 5 relevant is called a preclassifier, described briefly in co-pending U.S. Patent Application 
No. 09/696,730 filed October 25, 2000, entitied "Method and Apparatus for Performing 
High-Density DTMF, MF-Rl, MF-R2 Detection" by Tanrikulu et al, the teachings of 
which are incorporated herein by reference in their entirety, and shown and described in 
detail in Fig. 7. 

20 PRECLASSIFIER DESIGN 

Although it is possible to identify the frequencies in a signal with multiple 
sinusoidal frequencies, this is generally computationally too intensive for the apphcation 
under consideration. The preclassifier performs frequency estimation on a given signal 
assuming that it is a single sinusoidal. This assumption is logical since DTMF signals 
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map into single sinusoidals in each subband, when there is a valid DTMF signal on the 
telephone line. If the signal is not a DTMF signal - say speech, the preclassifier returns 
an inaccurate frequency estimate. However, the notch filter that is assigned by the 
preclassifier to finally validate the DTMF tone will reject the signal, since the signal 
5 will not satisfy the bandwidth or the twist requirements of a DTMF signal 

Fig. 7 indicates how Fig. 6 is modified with the preclassifiers 705a, 705b to select 
relevant notch filters firom the banks of notch filters 605a, 605b; note that DTMF is 
again assumed for simplicity. 

Following the band-splitter 315 and signal energy assessment (SEA) unit 320, the 
10 preclassifiers 705 a, 705b performs second order Auto-Regressive AR(2) modeling on 
S^. The preclassifiers 705a, 705b are designed based on the fact that a narrow-band 
discrete-time signal can be generated by the difference equation 

y(n) = -a^y{n - 1) - a^y(n - 2) + win) (9) 

where w(n) is white noise. By multiplying equation (9) with y(n - i), i = 1,2 and taking 
1 5 the expectation, the following AR model equations are derived: 





1 


a, 










.«2. 




3. 



(10) 



where r. - E{y{n)y{n - /)} . Therefore, the unknown parameters can be solved directly 
as: 



a,=d{r,r^-r,r,) \ 
2 , where a = -r— 



, where d = — r- (11) 
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is the determinant of the correlation matrix. Once a, anda2 are solved, the roots of the 
characteristic polynomial are determined as (assuming the roots are complex, i.e. 

> ): 



1,2 



5 Note that if 4^^ < of , then the roots are real, and the frequency is OHz. This 
condition must be checked. 

Finally, the dominant frequency of the narrowband signal can be obtained by using 
an arctan(.) fimction on the real and imaginary parts of the roots above. The arctan(.) 
function is unable to resolve all four quadrants of the unit circle. However, the 
1 0 frequency (in radians) that is being sought can be uniquely determined by using the sign 
of «! as: 



27^-" (6) 

2n 



where 



g{x) = arctan(Vx), x = ^ (7) 



a, 



1 5 The variation of x with respect to the frequency of the sinusoidal to be estimated is 
illustrated in Fig. 8. 

To summarize the procedure just discussed, first, r^.r^.r^ are computed by 

averaging. Then, d, a^,a^ are computed from equation (11). Finally, the preclassifier 
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705a, 705b returns with the value of x . Subsequent detection functions, such as the 
DTMF, MF-Rl , . , . operate on x to determine which candidate frequencies are relevant. 

hi other words, there is no need to evaluate g(x) , which costs many instruction cycles 

on a DSP due to the square-root and ai*c-tangent functions. Instead, the frequency axis is 
5 divided into regions for each subband, and the DTMF frequency that is the closest to a 
given measurement of x is identified. The presence of a particular frequency is then 
verified by using a notch filter 710a, 710b (Fig. 7). The decision boundaries between 
the DTMF frequencies in subbands are given in Table 4. 



SubbandO 




697 




770 




852 




941 




Decision 
Boundaries 


660.5 




733.5 




811 




896.5 




985.5 


fl, 


>0 




>0 




>0 




>0 




>0 


X 


3.284 




1.231 




0.456 




0.114 




0.002 


Subbandl 




367 




523 




664 




791 




Decision 
Boundaries 


289 




445 




593.5 




727.5 




854.5 


ay 


<0 




<0 




>0 




>0 




>0 


X 


1.641 




32.830 




10.929 




1.329 




0.242 


Table 4. 


rMF frequencies, decision bounc 


aries and their corresponding x values. 



20 The decision boundaries and the values of x in Table 4 are based on assuming that 

there are no estimation errors involved in computing r^.r^.r^. However, when these 

quantities are estimated over very short windows of data, there are problems with the 
accuracy of the estimation. To understand these estimation errors better, the following 
signal model can be used 



25 



y(n) = V2 cos(^77 + ^) + w(n% n = 0,...7V - 1 

fs 
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where is the frequency to be identified, = IKHz is the sampUng frequency, 6* is a 
uniformly distributed random variable in {-k,7i] to simulate the edge effects, w{n) is 
white, Gaussian noise for different levels of SNR, and is the length of data available 
for estimation. 

5 It turns out that windowing the signal before calculating r^,r^, is usefiil, especially for 
small N (e.g. JV=20 for 10msec frame size). Therefore, a windowed version of the input 
signal frame is computed as 

y{n) = y{n)xh{n) (9) 

where h{n) is an iV-point Triangular window. Among Bartlett, Blackman, Hamming, 
1 0 Manning, and Traingular windows, the Triangular window provides the best results for 
the worst performing frequencies. 

The correlation coefficients, r^,r^,r^, are estimated over a given frame of data as 

^o = li3^'(«X r,^Y.y^n)y{n + l), r, = Y^y{n)y{n + 2) (9) 

Note that it is also important to average the estimated correlation values across 
1 5 frames of data, which can be done by 

rXp) = arXp-l) + (l-a)r„ / = 0,1,2, 0<a<1.0 (10) 

and use r'ip) to estimate parameter x for a given window. 

DTMF logic 715 is employed to determine whether the received signal is a valid 
DTMF frame. Details of the DTMF logic 715 are discussed later in reference to Fig. 13. 
20 Figs. 9A and 9B are plots of experimental resuhs for DTMF frequencies 941Hz and 
770Hz for 20 (10msec framesize) for twenty Monte Carlo trials. 
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The windowing and averaging across frames help to keep the estimated frequencies 
in packed-around clusters. However, it can be seen that the estimated frequencies are 
biased from the ideal frequencies. This is due to the value of iV being too small. When 
A'^ is increased, experiments confirm that this bias goes to zero. Therefore, the entries in 
Table 4 are not directly used to classify DTMF frequencies, histead, new x values are 
computed based on Monte-Carlo simulations. The nominal frequencies, their biased 
estimates, the decision boundaries, the sign of ai, and the value of x are tabulated in 
Tables 5 and 6. Note that, here the decision boundaries are defmed as the middle points 
between the biased estimates. 



SubbandO 


Biased 
Estimates 


Decision 
Boundary 




X 


Subbandl 


Biased 
Estimates 


Decision 
Boundary 


aj 


X 


697 


694-1512 








367 


369.0281 












729.5704 


>0 


1.2936 






445.8795 


<0 


33.9273 


770 


764.9896 








523 


522.7308 












803.8430 


>0 


0.5016 






592.1336 


>0 


11.2751 


852 


842.6965 








664 


661.4616 












879.4865 


>0 


0.1582 






723.5534 


>0 


1.5222 * 


941 


916.2765 








791 


785.1352 









10 



15 



values, including 1.5% of each DTMF frequency, N=20 (10msec frame size). slightly 
larger than theoretical value. Used to pass the netcore test. 



SubbandO 



Biased 
Estimates 



Decision 
Boundary 



Subbandl 



Biased 
Estimates 



Decision 
Boundary 



X 



700 



697.0104 



300 



302.9915 



793.4531 



>0 



0.5752 



401.5480 



<0 



9.7931 



20 



900 



889-8957 



500 



500.1046 



598.5575 



>0 



9.7707 



700 



697.0104 



793-4531 



>0 



0.5752 



900 



889.8957 



Table 6. PreClassifier decision regions for MF-Rl and their corresponding x values, 
including 1.5% of each MF-Rl frequency, N=20 (10msec frame size). 

Note that the frequency estimation procedure is repeated for a range of frequencies 
between 20Hz and 980Hz for N=20 (10msec frame size), and the results are shown in 
25 Fig 10, 

Finally, since windowing adds to the computational complexity, its effect on 
detection performance is found to justify its computational complexity by repeating the 
experiment for Fig 10. The results in Fig, 1 1 illustrate quite clearly the necessity to use 
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windowing. Otherwise, there are large errors when the frequencies are above 900Hz or 
below lOOHz. 

MAIN CONTROL CODE 

At the output of the band spht unit 315 in Fig. 3, there is decision logic, referred to 

5 as a signal energy assessment (SEA) unit 320, that governs the rest of the operations in 
the classifier (CLS) 215 (Fig. 2). Different actions are taken based the results of the 
signal energy assessment (SEA). Recall that one objective is to achieve the minimum 
computational complexity while still applying to many different tone protocols. The 
least efficient approach would be to go and check the presence of DTMF, MF-Rl, ANS, 

1 0 etc. in a sequential manner. However, there is sufficient amount of a priori information 
to split this detection task into smaller pieces. 

The first piece of a priori information is, of course, the fact that a DTMF digit 
comprises two tones. From the definition of these tones and the way the signal is split 
into subbands, it is clear that a valid DTMF digit results in energy comparisons in the 

1 5 SEA 320 that point out significant energy in both subbands. If this is not the case, then 
there is no point in running a DTMF detector. If the SEA 320 determines energy in both 
subbands, the DTMF detector in (see co-pending U.S. Patent AppHcation No. 
09/696,730 filed October 25, 2000, entitled "Method and Apparatus for Performing 
High-Density DTMF, MF-Rl, MF-R2 Detection" by Tanrikulu et al, the teachings of 

20 which are incorporated herein by reference in their entirety) is called and used with only 
two notch filters - one per subband - to validate the tones. 

The second piece of a priori information is for the MF-Rl detection. In this 
detection, the subband division splits the firequencies into the groups of Fr={700, 900} 
in subband 0, and Fr={300, 500, 700, 900} in subband 1. Clearly, if the SEA 320 

25 detects signal energy in Subband 0 only, there is no point in using the preclassifier 705a 
(Fig. 7) since the candidate firequencies are 700Hz and 900Hz, and these can be verified 
by using notch filters. If there is a signal in both subbands, then two preclassifiers 705a, 
705b are necessary to determine the relevant notch filters. If there is signal in only 
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Subband 1, what the preclassifier 705b returns is not directly useful for MF-Rl 
detection, but for other purposes, as described later. 

The third piece of a priori information is that all other detection tasks, ANS, V.21, 
AA, LEC_DIS are handled in Subband 1 according to Table 1. When processing the 

5 signals in Subband 1 , the preclassifier 705a is used to differentiate between the 
possibihties of having MF-Rl tones, {V.21, AA} tones, or {ANS, LEC_DIS} tones, 
depending on the value of x returned by the preclassifier 705b. A value of x = 0.204 
corresonding to 135Hz marks the boundary between {ANS, LEC_DIS} and {V.21, 
AA} . Similarly, a value of x = 5.8284 corresponds to 375Hz. This marks the boundary 

1 0 between MF-Rl tones in Subband 1 and { V.2 1 , AA} . From Table 2 and Table 3, it 
appears that the 375Hz boundary has a contradiction in that the 1700Hz MF-Rl tone, 
which maps to 300Hz, and the 1650Hz part of the V.21, which maps to 350Hz, cannot 
be separated. This is not an issue since, when there is a valid MF-Rl signal in Subband 
1, it will consist of two tones, and, therefore, the preclassifier 705 will identify a 

1 5 frequency that is much higher than 300Hz - m fact, around 400Hz, which is the center of 
gravity of 300Hz and 500Hz. 

Equipped with all the above a priori knowledge, the classifier 215 uses a main 
control process, whose flowchart is given in Fig. 12. 

INDIVIDUAL DETECTORS 
20 Fig. 12 is a flow diagram of an embodiment of the main control process 1200 
executed by the high density signal classifier, CLS 215. 

The main control process 1200 begins in step 1205. hi the next step, the band-split 
unit 315 separates the received signal into two subband signals, Sq and Sj, as described 
above. Following the band-split unit 315, the signal energy assessment unit (SEA) 320 
25 processes the subband signals. 

In step 1210 in the SEA 320, the process 1200 makes a comparison between the 

total energy {Ej = Eq + E,) of the signal and a pre-determined threshold, Ej, as described 
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above in reference to equation (4). If the total energy, E^, of the received signal is 
greater than the energy threshold, Ej, then processing continues in step 1215. 

Step 1215 determines if there is energy in both subbands. If there is energy in both 
subbands, then the signal is determined to be either a DTMF signal or MF-Rl signal and 

5 processing continues in step 1220. hi step 1220, the values of the variables Xq and x, are 
determmed by the Preclassifiers 705a, 705b (Fig. 7) and Table 4. Next, in step 1225, a 
DTMF/MF-Rl detector uses the values of Xq and x„ along with Sq, Sj, Eq, E, to 
determine whether the received signal is DTMF or MF-Rl . DTMF detector portion of 
step 1225 is depicted in Fig. 13; the MF-Rl portion of step 1225 is depicted in Fig. 14. 

10 Continumg to refer to Fig. 12, if the total energy of the received signal is less than 
the energy threshold, £j, then processing continues in step 1230. In step 1230, if the 
received signal is determined to have energy only in Subband 0 but not Subband 1, then 
the received signal can only be an MF-Rl signal with 700Hz or 900Hz, and processing 
continues in step 1235, shown in detail in Fig. 14. 

15 Continuing to refer to Fig. 12, if step 1230 determines the received signal has energy 
only in Subband 1, then the signal may be ANS, AA, V.21, or MF-Rl in Subband 1. hi 
this case, processing continues in step 1240, where the value of variable Xj is 
determined by the preclassifier 705b (Fig. 7) and Table 4. If the corresponding, 
precalculated frequency, f,(Xi), is less than 375Hz (step 1245), then processing 

20 continues in step 1250; otherwise, the MF-Rl detector (Fig. 14) is executed in step 
1265. 

If f,(xi) is less than 135Hz (step 1250), then an ANS detector is executed in step 
1255, as described in detail later. If fc(x,) is greater than 135Hz, then an AA detector is 
executed in step 1260, also described later (in reference to Fig. 15). 
25 There are some similarities with DTMF and MF-Rl detection techniques in some 
recent designs described in co-pending U.S. Patent Apphcation No. 09/696,730 filed 
October 25, 2000 by Tantikulu et al, entitled "Method and Apparatus for Perfoming 
High Density DTMF, MF-Rl , MF-R2 Detection", incorporated herein by reference in 



2376.2003-000 



-25-- 



its entirety. The designs in Tantikulu et al. do not focus on the preclassifier 705, and, 
also, the designs are isolated implementations of DTMF and MF-Rl detection. 
Therefore, in the DTMF case, previous designs were mainly directed to utilizing eight 
notch filters, instead of the two notch filters and preclassifier 705 as in the present 
5 invention. Similarly, the previous MF-Rl detector designs utilize six notch filters, 
whereas the current implementation of the signal classifier uses only two notch filters 
and a preclassifier in Subband 1, if required. 

DTMF DETECTION 

As mentioned above, this DTMF detector of the present invention uses a structure, 

10 as in Fig. 7, rather than the structure in Fig. 6. It turns out that since the implementation 
of a preclassifier 705 consumes fewer instruction cycles than the implementation of 
three notch filters, the design in Fig. 7 is more efficient in computational complexity. 
Furthermore, the design of the present invention is more memory efficient, since 
memory is reserved for each of the eight notch filters in Fig. 6, but stores only two notch 

15 filter histories. The preclassifier 705, and the modification of Fig. 6 to Fig. 7 with fewer 
instruction cycles and less memory, are distinguishing aspects of the present invention. 

The DTMF detection logic 715 of the DTMF detector in Fig. 7 has similarities to 
what is described in co-pending U.S. Patent Application No, 09/696,730 filed October 
25, 2000 by Tantikulu et aL, entitled "Method and Apparatus for Performing High 

20 Density DTMF, MF-Rl , MF-R2 Detection", incorporated herein by reference in its 

entirety, but it also has new aspects to do with the presence of the preclassifier 705. The 
flowchart of Fig. 13 describes the DTMF detection logic 715 in the present invention 
classifier 215 (Fig. 2). Note that the order of the steps in this flowchart can be altered 
without significantly changing the operation of the DTMF detector. Further note that Sq, 

25 iSj, E^, ^1 ^i*^ inputs to the DTMF detector. 

Continuing to refer to Fig. 13, the DTMF detector logic 715 begins in step 1305. In 
step 1310, the process determines a first notch filter based on Xq. In step 1315, the first 
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notch filter is applied to the Subband 0 signal, Sq. hi step 1320, the process determines 
a second notch fiher based on x^. In step 1325, the second notch filter is applied to the 
Subband 1 signal, S^. The notch- filtered signals, Sq and Sj, are analyzed by a twist test 
1330 and a bandwidth test 1335. 
5 The Twist-Test (TT) 1330 is used to determine whether 



is satisfied in accordance with the international standards, which means that the energies 
of the row and column fi-equencies of a DTMF signal are within certain levels of each 
other. 

10 The Bandwidth-Test (BT) 1335 makes sure that if the frequency deviations on the 
tones are within 1.5% of the nominal frequency, they are declared as valid; and, if the 
frequency deviations are outside of the 3.5% of the nominal frequency, they are declared 
as invalid. This is accomplished by checking the input-output energy ratio of the notch 
fihers used, hi other words, if the output energy for the p-th notch filter in the q-th 

15 subband is E^ q, and the notch filters i and j are selected by the preclassifiers 705, then 
the BT 1335 is 



Note that in Fig. 13, if the TT 1330 and BT 1335 are passed, then the current frame of 
data is declared as a valid DTMF frame (step 1 340). However, these frame-based decisions 
20 are monitored over time to declare vahd DTMF digits. Such decisions have to do with the 
minimum duration and short periods of signal drop-outs encountered in DTMF signals. 



E • 

— ~>Sf, and 
F 




(12) 



MF-Rl DETECTION 
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Referring now to Fig. 14, the MF-Rl detector is called (steps 1225, 1235, 1265) in 
three different ways in the flow diagram of Fig. 12 depending upon how the SEA 320 
labels the signal presence in subbands. Three conditions exist, and these are described 
below. 

5 Referring to Fig. 14, if the signal energy assessment (SEA) unit 320 determines 
there is signal in both subbands (step 1215, Fig. 12), the MF-Rl detector is called (step 
1225), and a TT 1415 is performed to make sure that the difference between signal 

levels in the two subbands are as in equation (11) with different (£[ , ) values, 
specific to MF-Rl detection. Then, the preclassifier results, which are already available 
10 (i.e., predetermined), are used with the decision regions in Table 6 to select one notch 
filter per subband (steps 1420, 1425). These notch filters are then used (step 1430) to 
validate the presence of a valid MF-Rl signal in a given frame of data through a 
bandwidth test 1435, which is again a ratio of the output energy to the input energy. 

Note that the thresholds (S^, Sy) are again different than in equation (12). If the 

15 bandwidth test 1435 is met, then a valid MF-Rl frame is reported in step 1485. 

Continuing to refer to Fig. 14, in the second case, the SEA 320 labels the current 
frame as having energy only in Subband 0 (step 1230, Fig. 12), and processing 
continues in step 1235. From Table 6, it is known that there are only two candidate 
frequencies, 700Hz and 900Hz, in this case, hi this case, notch filters are apphed (step 

20 1440) and a Noise-Test (NT) 1445 is performed to reject noisy signals, where the noise 
test involves 

F 4- F ^ 

^ 0,(700 Hz) ^ ^0,(900 Hz) ^-i 

This test makes sure that the energies are concentrated around 700Hz and 900Hz only. 
After the NT 1445, a TT 1450 is performed to rule out if a single tone is present only. 
25 This is again of the form 
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(14) 



Finally, a BT test 1455 is performed which is of the form 

—^5 — Is" and — >5" (15) 

F F 

^OXlOOHz) ^0,(900i/z) 

If the BT test 1455 is met, then a vahd MF-Rl frame is reported in step 1485. 
5 Continuing to refer to Fig. 14, as the third condition, the SEA 320 labels the current 
frame as having signal energy only in Subband 1 (step 1230, Fig. 12), and processing 
continues in step 1265. In this case, the possibilities are 300Hz, 500Hz, 700Hz, and 
900Hz according to Table 6. The detection procedure (steps 1240, 1245, 1265, Fig. 12) 
is as follows. First, four notch filters are instantiated (step 1460) and the output 
10 energies are obtained. Then, the two frequencies that have the lowest output energy 
(step 1465) are labeled as Ej^^ and Ej^j. Then, the tests in the second case above are 
repeated (steps 1470, 1475, 1480) at these frequencies using and Ej^^ and also Ej. If 
the tests 1470, 1475, and 1480 are met, then a valid MF-Rl frame is reported in step 
1485. 



15 AA DETECTION 

The signal AA is an 1800Hz tone that maps into 200Hz in Subj according to Table 
3. AA detection is activated if the SEA 320 determines there is signal only in Sub^ 
(steps 1240, 1245, 1250, 1260, Fig. 12), if the preclassifier return value indicates a 
candidate tone with a frequency between 135Hz and 375Hz. If these conditions are 

20 satisfied, a notch filter at 200Hz (not shown) is used to filter the signal in Subj, and a 
BT test (not shown) is performed that checks to see if 



7 



(200 Hz) 



(16) 
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V.21 DETECTION 

If the AA detector returns with a FALSE decision, then a V.21 detector is initiated 
(step 1260). Referring to the V.21 detector 1500 of Fig. 15, the V.21 signal is a binary 
FSK signal with frequencies at 1650Hz and 1850Hz. At the 2KHz sampling rate of the 
classifier 215 structure, these firequencies map into 350Hz and 150Hz respectively. The 
baud rate of V.21 is 300bits/sec. hi the VoIP apphcation where the classifier 215 is 
used, the objective of the V.21 detector is to detect FAX calls over the system and 
switch the system into the appropriate mode. Note that, in a FAX call, the V.21 signal 
will always start with the repeating bit sequence of . . .001 11111.... hnplementing a fiill 
FSK demodulator is computationally too expensive, since 300 bits/sec corresponds to 
each bit being represented by 2000/3000=6.67 samples/bit. Therefore, an interpolation 
filter is needed. However, note that the obj ective is just to detect ... 00 1 1 1 1 1 1 not to 
demodulate any arbitrary V.21 signal. Therefore, an approximate non-coherent 
demodulator can be used, where it is assumed that samples/bit = 7, as indicated in a 
delay 1505 of Fig. 15. 

Continuing to refer to Fig. 15, a LPF 1510 is a single-pole IIR filter with the transfer 

function 

1 -a (^^^ 
LPF = a=0<a<l 

1 -a 

A typical output of the LPF 1510 is shown in Fig. 16. Note that the decision logic 
1515 first looks for 16 positive values to locate the beginnmg of a meaningfiil signal. 
This corresponds to the initial pile of positive numbers before the first zero-crossing in 
Fig. 16. Then, once the positive values are detected, then the first zero-crossing is 
located as the first transition firom positive to negative. After the zero-crossing, a 
countdown is initiated fi-om 3 to zero to get the first bit. Then, since the ratio of 
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samples^it is not an integer, countdowns from 1 3 to zero are used to get the other bits. 
This essentially corresponds to detecting a decimated version of the bit sequence 

.001 11 1 1 1 ... as ... 01 11 ... . After each detection of 01 1 1 , the detector is reset to detect 
the next zero-crossing again. As long as . , .01 1 10. . . are detected, it is determined that a 
5 V.21 signal is present. Note that, due to samples^it being a fractional number, the 
structure in Fig. 1 5 will occasionally miss the bit pattern ... 01 1 1 0. but since only the 
detection of the beginning of a V.21 is required, the above structure should be sufficient. 
Note that, some FAX machines signal the sequence ...01111110.... The above 
described mechanism is sufficient to detect again .. .01 1 1 ... on this sequence. 



1 0 ANS TONE AND PHASE REVERSAL DETECTION 

The ANS tone is a tone commonly used in data modems. The classifier 215 (Fig. 2) 
detects this 2100Hz tone (steps 1215, 1230, 1240, 1245, 1250, Fig. 12), which maps 
into Subj at lOOHz. The detection of this tone is through a notch filter at lOOHz (not 
shown), and a BT test (not shown) is performed for this purpose that checks to see if 



15 Y^^>Sans (18) 

^1,(100 ife) 



if this is true for a particular frame that is noted in memory. In modems, such as V.34 
and V.90, there are phase reversals imposed on the ANS Tone, These phase reversals 
essentially make the signal wide-band at the moment of phase reversal. Therefore, an 
ANS detector with a notch filter would fail to recognize such a frame as a valid ANS 

20 tone. The frequency-domain dispersion of the signal due to the phase reversal dissipates 
one 10msec frame. Therefore, the decisions for ANS detection are made over two 
frames, and if the decisions are 1 1 or 01 or 10, a valid ANS tone is still declared. Only if 
the decisions are 00, the CLS 215 (Fig. 2) declares there is no ANS tone. 

The detection of the phase reversal is de-coupled from the detection of the ANS 

25 tone, but existence of phase reversals are checked only if a vahd ANS tone has been 
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detected. The detection of the phase reversal is based on computing a normalized 
crosscorrelation coefficient between successive periods of a 2100Hz tone. As noted 
above, 2100Hz maps into lOOHz in Sub^, where the sampling frequency is 2ICHz, So, a 
single period of this signal is 20 samples, or lOmsec. This happens to coincide with the 
5 frame size that is used in the subband classifier 215 (Fig. 2). The advantage is that one 
period of the 2100Hz tone exactly fits into one frame of data in Sub^. The basis of the 
detection of phase reversals is illustrated in Fig. 17 

The normahzed cross-correlation coefficient over two consecutive frames of data 
- frame X and frame F- for signal s(n) is 



1 n+A^-l 

10 r^j-— J]s(j)sii-N) (19) 

where 

cr'=i;^'(0 (20) 

i=n-N 

is the energy computed over the two frames of data, X and Y. From equations (19), 
(20), and in Fig. 17, if Z = A,Y = B, then r^^ = 0.5. Similarly, if X = 5, 7 - C, then 

15 J. = -0.5. Finally, beyond the phase reversal, if X= C, 7 = D, then r^^^y ^0.5 again. 

Therefore rj^j will dip negative during a phase reversal and the following condition is 
used to detect it 



20 



^xj < (21) 
The minimum negative value will depend upon the amount of phase reversal in 
degrees. Another feature of this detector is that it is non-coherent, meaning, if the ANS 
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tone has some frequency shift, the phase reversal detector is not affected by it. Finally, 
since the original 8KHZ signal is put into subbands, if the original signal has wide-band 
noise superimposed, the band-splitting operation rejects the noise in the lower half of 
the spectrum, which increases the Signal-to-Noise Ration (SNR) by 3dB. Note that, for 

5 each phase reversal, r^^j stays negative for a number of samples. Therefore, a lockout 
time of 20msec is used to make sure that the same phase reversal is not detected twice. 

A number of plots are given below that illustrate the variation of ^ under 

different conditions. These conditions exercise various phase reversal amounts and 
SNRs. 

10 Extended tests have been performed, where 250 phase reversals are introduced for 
different amounts of phase reversals. The results are presented in Tables 7 and 8. hi the 
first test in Table 7, all phase reversals are detected, whereas in Table 8, none of the 



phase reversals quaUfy as valid for detection, and therefore must be rejected. 
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Table 7, Accept Test Results. 
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Table 8. Reject Test Results. 



30 Note that, the detection of the phase reversals is accurate even at 5dB SNR. 

However, it should be noted that the above results were obtained with a MATLAB® 
equivalent of the phase reversal detection algorithm. In the real (i.e., non-simulated) 
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classifier 215, the detection performance worsens below 1 IdB SNR. This is because, for 
such low SNR values, the SEA 320 in the classifier 215 decides there is signal in both 
subbands, but the existence of phase reversals or the 2100Hz tone are not checked under 
that condition. This is due to keeping the complexity to a minimum. If detection below 
5 1 IdB SNR is required, the ANS tone and phase reversal detections must be called if 
SEA 320 decides there is signal in both subbands. Note that, in this case, the 
preclassifier decisions may be used to differentiate between DTMF/MF-Rl and ANS 
tone situations. This is an open issue for further exploration. 

ALGORITHMIC COMPLEXITY AND MIPS ESTIMATES 
10 Table 9 is a table of the processes that are called covering all the possibihties 
described in Figs. 12-17 above. 



EO >= 
EO_thrshld 


El >=El_thrshld 


Blockwise Complexity 


TRUE 


TRUE 


HPF+BS+PCo+PCj+2N (DTMF)+2N(MF-R1) 


TRUE 


FALSE 


HPF+BS + 2N (MF-Rl,700H2,900Hz) 


FALSE 


TRUE 


HPF+BS+PCi+ 4N (MF-Rl) - (PC, > 375 Hz) 
Or 

HPF+BS+PCi+N(ANS) - ( 135Hz < PCi < 375 Hz) 
Or 

HPF+BS+PCi+N(AA) +2N (V.21) -(PCi < 135Hz) 



Table 9, Algorithmic Complexity, 



where 

BS: is the band-split, 
20 PCo 1 : is the preclassifier running on subbandO and subbandl , 
N: is a notch filter. 

As indicated by Table 9, the overall algorithmic complexity is gated by the first 
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case. Therefore, the complexity of the classifier 215 is HPF+BS + 2PC + 4N. Note that 
the HPF 310 is running at 4KHz sampling fi-equency. 

The following description provides estimates of some MIPS numbers for DTMF 
detection by the classifier 215. Based on a recent design an efficient DTMF detector. 
5 The complexity for DTMF is BS+2PC+2N. Note that the affect of frame sizes on cycle 
counts is ignored in heu of those measured during testing of the efficient DTMF 
detector; namely, for 10msec, 6 cycles/sample for notch, and 8 cycles/sample for the 
band-splitter 315. It is also assumed that there are 20 cycles of overhead associated with 
band splitting and notch filtering operations (setup + epilogue + prologue). 
10 Preclassifying is approximately (Idivs + 3*20 cycles (r(, rj, r2)+20 

cycles(windowing)+3 cycles(alpha filtering on Tq rj, r2)+12 cycles (add,mul,sub)) = 125 
cycles per 10msec fi-ame - assuming 30 cycles for a division. Therefore, the total is 
(8*2000+2*6*2000+125*100) = 52500 cycles/sec. By the same calculation method, the 
DTMF detector implemented as in Fig. 6 is BS+8N = 1 12000 cycles/sec. Note that the 
1 5 effect of the high-pass filter 310 (Fig. 3) is ignored. The high pass filter 310 which 
would make the difference less but not by a large factor. Therefore, the classifier 215 
equipped with the preclassifier 705 is a more efficient embodiment than using a notch 
filter 605a, 605b (Fig. 6) for every frequency of interest. 

As a first reference point for the reduction in complexity of the classifier 215 
20 employing the principles of the present invention, in a recent implementation, the 
classifier 215 executed on a Texas histruments® TMS320C54x DSP with 0.7 MIPS, 
whereas a traditional classifier would execute in 7 MIPS or more on the same DSP. As 
a second reference point, in another recent implementation, the classifier 215 executed 
on a TMS320C6X DSP in 0.3 MIPS, whereas a traditional classifier would execute in 3 
25 MIPS or more on the same DSP. The complexity typically translates proportionally to 
the number of instructions and instruction cycles. It should be understood that the MIPS 
(i.e., execution speed) is dictated, in part, by the compiler used to compile the classifier 
software, the architecture of the processor, the pipelining efficiency of the processor, the 
clock speed, and other typical variables affecting the computational efficiency of the 
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DSP or other processor. 

While this invention has been particularly shown and described with references to 
preferred embodiments thereof, it will be understood by those skilled in the art that 
various changes in form and details may be made therein without departing from the 
5 scope of the invention encompassed by the appended claims. 



